Song authorship attribution: a lyrics and rhyme based approach
نویسندگان
چکیده
Abstract In this work, we apply authorship attribution to a large-scale corpus of song lyrics. As sub-category poetry, lyrics embody cultural elements as well stylistic attributes that are not present in prose. We draw attention special characteristics such repetitive sound patterns and rhyme based structures can be key ownership, opportunities cannot employed for other types text tweets, emails, blog posts. first create new balanced, data set 12,000 from 120 different artists. propose CNN models on lyric set, order use structural information included the lyrics, similarly image classification. conduct experiments at character sub-word levels mostly reflect positional information. addition, phoneme level features, which intrinsically involve repetitions, rhyme, meter, represent unique verse-based textual compositions. attempt discover idiosyncratic features consequently author genre associations by working with variants architectures have been successfully used classification domains. Our architecture choice results particular focus residing neighboring regions, since CNNs fail apprehend long term dependencies. Finally, empirically evaluate our comparison findings previous test research
منابع مشابه
Artist Attribution via Song Lyrics
Song lyrics, separated from the audio signal of their song, still contain a significant amount of information. Mood and meaning can still be conveyed effectively by a pure textual representation. There has even been somewhat successful previous work on genre classification from song lyrics[7]. Building on previous work, we seek to build an artist attribution system for song lyrics. This task is...
متن کاملA Web-Based Self-training Approach for Authorship Attribution
As any other text categorization task, authorship attribution requires a large number of training examples. These examples, which are easily obtained for most of the tasks, are particularly difficult to obtain for this case. Based on this fact, in this paper we investigate the possibility of using Webbased text mining methods for the identification of the author of a given poem. In particular, ...
متن کاملRhyme and Style Features for Musical Genre Classification by Song Lyrics
How individuals perceive music is influenced by many different factors. The audible part of a piece of music, its sound, does for sure contribute, but is only one aspect to be taken into account. Cultural information influences how we experience music, as does the songs’ text and its sound. Next to symbolic and audio based music information retrieval, which focus on the sound of music, song lyr...
متن کاملAuthorship Attribution
Authorship attribution, the science of inferring characteristics of the author from the characteristics of documents written by that author, is a problem with a long history and a wide range of application. Recent work in “non-traditional” authorship attribution demonstrates the practicality of automatically analyzing documents based on authorial style, but the state of the art is confusing. An...
متن کاملMaximal Repeats Enhance Substring-based Authorship Attribution
This article tackles the Authorship Attribution task according to the language independence issue. We propose an alternative of variable length character n-grams features in supervised methods: maximal repeats in strings. When character ngrams are by essence redundant, maximal repeats are a condensed way to represent any substring of a corpus. Our experiments show that the redundant aspect of n...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Digital Humanities
سال: 2022
ISSN: ['2524-7832', '2524-7840']
DOI: https://doi.org/10.1007/s42803-022-00050-x